Dirichlet process model for joint haplotype inference and GWAS

نویسندگان

  • Avinash Das Sahu
  • Sridhar Hannenhalli
چکیده

Identification of causal genomic mutations that underlie disease phenotypes remains a key problem in the field of medical informatics. With the advent of new sequencing technologies and decreasing cost of human genotyping, it is now possible to study genotype-phenotype interactions, such as genome-wide association studies (GWAS), at the population level. However, due to large genomic variance and linkage disequilibrium, genetic diversity of a complete human population cannot be captured by a limited number of clusters. Furthermore, application of current haplotype inferencing (phasing) methods to rare genomic variance, such as disease-related alleles, is not reliable. Hence, a satisfactory method for deleterious mutation identification remains largely elusive. Here we present a non-parametric Bayesian model that jointly infers haplotypes and identifies deleterious mutations, taking into consideration genomic variance in the human population. The model is based on the Dirichlet process, which can capture genomic variance by modeling it with nonbounded numbers of clusters.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Nonparametric Bayesian Approach for Haplotype Reconstruction from Single and Multi-Population Data

Uncovering the haplotypes of single nucleotide polymorphisms and their population demography is essential for many biological and medical applications. Methods for haplotype inference developed thus far –including those based on approximate coalescence, finite mixtures, and maximal parsimony– often bypass issues such as unknown complexity of haplotype-space and demographic structures underlying...

متن کامل

یک مدل ریاضی جدید برای مساله استنباط هاپلوتایپ‌ها از ژنوتایپ‌ها با معیار پارسیمونی

The haplotype inference is one of the most important issues in the field of bioinformatics. It is because of its various applications in the diagnosis and treatment of inherited diseases such as diabetes, Alzheimer's and heart disease, which has provided a competition for researchers in presentation of mathematical models and design of algorithms to solve this problem. Despite the existence of ...

متن کامل

Accelerating Haplotype-Based Genome-Wide Association Study Using Perfect Phylogeny and Phase-Known Reference Data

The genome-wide association study (GWAS) has become a routine approach for mapping disease risk loci with the advent of large-scale genotyping technologies. Multi-allelic haplotype markers can provide superior power compared with single-SNP markers in mapping disease loci. However, the application of haplotype-based analysis to GWAS is usually bottlenecked by prohibitive time cost for haplotype...

متن کامل

Spectrum A software for inferring population structure and recombination events

Spectrum is a software for joint inference of population structure and recombination events from multi-locus SNP haplotypes. Under non-parametric Bayesian framework using Hidden Markov Dirichlet process, the genetic inheritance process under mutation and recombination event is inferred. Assuming a number of founder haplotypes, it recovers the association of each individual haplotype with founde...

متن کامل

Hidden Markov Dirichlet Process: Modeling Genetic Inference in Open Ancestral Space

The problem of inferring the population structure, linkage disequilibrium pattern, and chromosomal recombination hotspots from genetic polymorphism data is essential for understanding the origin and characteristics of genome variations, with important applications to the genetic analysis of disease propensities and other complex traits. Statistical genetic methodologies developed so far mostly ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 6  شماره 

صفحات  -

تاریخ انتشار 2012